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Abstract. Let t > 1 and let n, M be natural numbers, n < M. Let A = (djj) be 
an n x M matrix whose rows are orthonormal. Suppose that for all j 

\i=i 

Using majorizing measure estimates we prove that for every e > there exists a set 
/ C {1, . . . , M} of cardinality at most 

t 2 

C ■ — ■ n ■ log n 

e z 

so that for all x £ IV^ 




(l-e)-N|< 



R I A T x\\ <(! + £)■ \\x\ 



Here Rj : M. M — > M. M is the orthogonal projection onto the space span{ei | i G /}, 
where {eij^j^ is the standard basis of lif . 



1. Introduction 

We consider the following problem, posed by B. Kashin and L. Tzafriri [K-T]: 
Let e > and let n, M be natural numbers, n < M . Given an n x M matrix A 
whose rows are orthonormal, what is the smallest cardinality L(A,e) of a subset 
I C {!,..., M} so that for all x e 




[1A) (1 -e) ■ \\x\\ < W— • LR 7 A J x < + • ||x 



Here Ri : M M — > 1R M is the orthogonal projection onto the space spanje^ | i e J}, 
where {e^}^ is the standard basis of M M . Throughout this paper we denote by 
||-|| the standard ^2-norm and by |/| the cardinality of a set /. 
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Under an additional assumption that all the entries of A have the same absolute 
value 1 / \[M Kashin and Tzafriri proved that 

(1.2) L(A,e) < 4 • n 2 logn. 

Moreover, their proof shows that a random subset / of this cardinality satisfies 
(1.1) with probability close to 1. Clearly, the estimate (1.2) is not optimal. The 
example of random selection of columns of a rectangular Walsh matrix, considered 
by Kashin and Tzafriri suggests that the possible upper bound could be 

(1.3) L(A,e) < C(e) ■ nlogn. 

From the other side, simple examples ([K-T], [R]) show that the estimate (1.3) is 
the best one can obtain by the random selection method. 

As it was mentioned in [R], the Kashin and Tzafriri problem is dual to that of 
finding an approximate John's decomposition. Entropy estimates used in [R] for 
the last problem enabled to improve (1.2). More precisely, let t > 1 and suppose 
that the matrix A satisfies 




M 
n 



1/2 



for all j = 1,...,M. Then 

L(A, e) < C(e) ■ t 2 ■ n log 3 n. 

In order to improve this estimate one can use majorizing measures instead of 
entropy estimates. The method of majorizing measures, developed by Talagrand 
([L-T], [Tl]), is extremely useful in obtaining estimates of stochastic processes, 
related to random selection. A random process, similar to that arising in the 
Kashin and Tzafriri problem was considered by Talagrand [T2] for the problem of 
embedding of a finite dimensional subspace of L p into £p . For this kind of processes 
Talagrand introduced a special method of constructing majorizing measures. This 
method (s-separated trees) can be used to prove an estimate 

L(A, e) < C(e) ■ t 2 - nlogn - (log log n) 2 

for the Kashin and Tzafriri problem. It is unlikely that the (log log n) 2 factor can 
be removed by a modification of the s-separated trees method. However, using a 
different approach based on the explicit construction of a partition tree, we obtained 
a sharper estimate. More precisely, we prove the following 

Theorem. Let t > 1 and let A = (a^) be an n x M matrix, whose rows are 
orthonormal. Suppose that for all j 



(1.4) 
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Then for every e > there exists a set I C {1, . . . , M} so that 



(1.5) 



I\<C 



—z-n- log n 



and for all i6B' 



n 



(1.6) 



(1 -e) ■ \\x\\ < 




R I A T x\\ <(l + e)- ||x 



Throughout this paper C, c etc. denote absolute constants whose value may 
change from line to line. 

The main part of the proof is the proof of Lemma 1 below. Our original proof 
of this lemma used the direct construction of the majorizing measure. It included 
an explicit construction of a sequence of partitions and putting weights on the 
elements of each partition. This scheme is based on the Talagrand and Zinn's proof 
of the majorizing measure theorem of Fernique (Proposition 2.3 and Theorem 2.5 
[T4]). The proof was rather involved, since we had to approximate the natural 
metric of a random process by a family of metrics depending on the elements of 
the partition. After we had shown our proof to M. Talagrand, he pointed out that 
the explicit construction of the partition tree may be substituted by applying his 
general majorizing measure construction (Theorems 4.2, 4.3 and Proposition 4.4 
[T4]). This resulted in a considerable simplification of the proof. We present here 
the argument suggested by Talagrand. 

By the duality between the Kashin and Tzafriri problem and approximate John's 
decompositions, we have the following 

Corollary. Let B be a convex body in M. n and let e > 0. There exists a convex body 
K C M, n , so that d(K,B) < 1 + e and the number of contact points of K with its 
John ellipsoid is less than 



Clearly, we may assume that M > C ■ \ ■ n ■ log n for some absolute constant C. 

The proof of the Theorem is based on the following iteration procedure. Let 
A = (aij) be an n x M matrix , satisfying (1.4), We define a sequence {si}fLi °f 
independent Bernoulli variables taking values ±1 with probability 1/2 and put 



m(n, e) = C{e) • n ■ log n. 



2. The random selection method 



h = {i I Si = 1}. 



Then 
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with probability at least 1/4. Define 



W = AR n 

and denote by w(l), . . . , w{M) the coordinates of a vector w. We have to estimate 

M 

sup 2 ■ y^-w 2 (i) — y~]w 2 (z) 

M 

^£,«; 2 (z) 



sup 



2 H^/Axll 2 - ||xf 



i=l 



sup 

wewc\B™ 



i=i 



Denote by EX the expectation of a random variable X. The key step of the proof 
is the following 

Lemma 1. Let W be an n- dimensional subspace of M. M . Let E\, . . . , Em be inde- 
pendent Bernoulli variables taking values ±1 with probability 1/2. Then 



E sup 



M 



< CyiogM- \\P W : i\ 



M ^ qM\ 
> l 2 



Here Pw '■ ^ M — > K M is the orthogonal projection onto W . 
From (1.4) it follows that 



n 




so by Lemma 1 and Chebychev's inequality we have 



(2.2) 



sup 

xeB™ 



2 \\R h Ax\ 



x 



<C-t 



with probability more than 3/4. Thus, there exists a set ii C {1, . . . , M} satisfying 
(2.1) and (2.2). 

Repeating this procedure, we obtain a sequence of sets {1, . . . , M} = Lq D ii D 
. . . D L 8 so that 



(2.3) 



and 



1 - 



1 



< I < 



141 

2 



(2.4) jsup (2 fc \\R Ik Ax\\ 2 - 2 k ~ l WR^Axf) <C-t- J ^ ■ V^lh^i 



M/2* 
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(2.5) 



Indeed, at each step of induction we have 

1 



i fc-i ii „ ,M 3 .. 
u x\\ < 2 2 \\Rj Ax\\ < - \\x\ 
2 n ii - II ik-i || - 2 H i 



Assume for simplicity that Ik-i = {l,...,m} for some m < M. Let Wk = 
R^AR™ C R m and let P Wk : R m -> R m be the orthogonal projection onto 
W k . Then 

so for a random set I\. C {!,..., m} we have 



E sup (2 fc ||i? 7fc Axf -2 k ~ x HiJ/^.AxH 2 ) <E sup j^S^ 2 ^) < 



-E sup EjW 2 (i). 
To apply Lemma 1 we need to compute ||iV fe : ~^ ^T\\- By (2.5) we have 



fe+i 
< 2— -t 




Now (2.4) follows from Lemma 1 and Chebychev's inequality. 
Summing up inequalities (2.4) we get 



(2.6) 



sup 

xEB% 

c-t- 



2 S ||1^|| 2 -H 2 



<ct- 



n 



M/2* 



n / M 
M/2 S V ° g 2 1 ' 



We proceed until the last expression is greater than e/2. In this case 



t 2 M t 2 

c ■ — ■ n ■ log n < — < C ■ — ■ n ■ log n. 

e 2 2 s e 2 



From (2.3) it follows that 



M 
2 1 



1 - 



< \h\ < 



M 



so we obtain (1.5) and 
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Then, (2.6) implies that 

sup 

and this completes the proof of the Theorem. □ 

Remark. The random selection method was used first by Talagrand [T3] to simplify 
the construction of embedding of a finite dimensional subspace of L\ into 1^ . The 
original construction of Bourgain, Lindenstrauss and Milman used the empirical 
distribution method instead of it. The advantage of the random selection is that it 
enables to deal with random processes having a subgaussian tail estimate, rather 
than with general Bernoulli processes. 

3. Construction of the majorizing measure. 

The proof of Lemma 1 uses the majorizing measure theorem of Talagrand [Tl], 
[T4] . This theorem provides a bound to 

EsupX* 

teT 

for a subgaussian process X t indexed by points of a metic space T with a metric 
d through the geometry of this space. However it turns out that the space T does 
not have to be assumed metric. The same proof works in the case when d is a 
quasimetric, i.e. if there exists a constant A such that for any t,t, s G T 

d(t,t) < A - (d(t,s) + d(s,t)). 

We use the following version of 

Majorizing measure theorem. Let (T, d) be a quasimetric space. Let (X t ) te T 
be a collection of mean random variables with the subgaussian tail estimate 

P{|X«-X f |>a}<exp(-c^ 

for all a > 0. Let r > 1 and let ko be a natural number so that the diameter ofT is 
less than r~ h ° . Let {<pk}kLk be a sequence of functions from T to M, + , uniformly 
bounded by a constant depending only on r. Assume that there exists a > so that 
for any k the functions <fk satisfy the following condition: 



M o 9 

— ■ \\R Is Ax\\ 2 - \\xf 



< e 



for any s G T and for any points t\,...,tN G B r -k(s) with mutual distances at 
least r~ k ~ x one has 

(3.1) max ip k+2 (tj) > (fik(s) + a ■ r~ h ■ \/log N . 

j=l,...,N 

Then 

EsupXt < C(r) - a" 1 . 
teT 

This version may be obtained as a combination of the majorizing measure theo- 
rem of Fernique [L-T] and the general majorizing measure construction of Talagrand 
(Theorems 2.1 and 2.2 [Tl] or Theorems 4.2, 4.3 and Proposition 4.4 [T4]). 

To prove Lemma 1 we need some estimates of covering numbers. Denote by 
N{B,d,e) the £-entropy of B, i.e. the number of e-balls in the (quasi-) metric d 
needed to cover the body B. We use the following 
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Lemma 2. Let W be an n- dimensional subspace ofR M and let Pw be the orthog- 
onal projection onto W . 



(1) e^\ogN{Bf n W, (ML ,e) < C ■ \\P W : If - if\ ■ 

(2) Let H'llg be a norm defined by 



M 



1/2 



i=i 



Then 



ey/logN(Bf H W, \\-\\ s ,e) < C ■ \\P W : if - lf\ ■ (j> 



1/2 



2 

a 

i=l 



Proof. Both statements follow from the dual Sudakov minoration [L-T] . 

(1) Let g be the standard Gaussian vector in R M . Then Pw9 is the standard 
Gaussian vector in the space W. So, 

eJ\ogN(Bf nW.IHIoo.e) < C • E \\Pw9\L = C ■ E . max e,-)| < 

V j=l,...,M 

C-y/logM- max \\P w e j \\ = C-^/\ogM-\\P w :lf^lf\\. □ 

j = l,...,M 

(2) Again dual Sudakov minoration gives 

e^logiV(£f rW,||-|| f ,e) < C ■ E \\Pwg\\ e < C ■ (e ||P^|| j)^ = 

(M \ 1 / 2 / M \ 1 / 2 



Proof of Lemma 1. Denote 



W! = Bfn W. 



We have to estimate the expectation of the supremum over all w G W\ of a random 
process 

M 

V w = s ^e l w 2 {i). 

i=i 

The process V w has a subgaussian tail estimate 



V {V w -V iB >a}< exp -c 



a 2 



<i 2 (iy, it?) 
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< 



where 

/ M 2 \ l / 2 

d{w,w)=\Y^(w\i)-w\i)) J . 
We shall estimate the metric d by a quasimetric, which is simpler to control. 

1 „ ( M 2 X V2 

Since 

/ M 2 \ V2 

\i=l / 

— j= ■ (^d(w, w) + \\w — w\\^M^j < \/2 ■ tD), 

we have a generalized triangle inequality for d. Namely for all u,w,weW 
(3.2) d(w,w) < 4- (d(w,u) + d(u,w)). 

The balls in the quasimetric d are not convex. However, we have the following 
Lemma 3. For all w G W and p > 

conv B p (w) C B± p (w). 

Here we denote by B p (w) a p-ball in the quasimetric d. 
Proof. Note that since for all u G B p {w) 

1/2 



and 



^u(i) - w(i)J w 2 (i) J < p 



M A V4 

£ (u(i) - «,(»)) ^(v^p) 1 / 2 , 



,2 , l2 



the same inequalities hold also for all u G convB p (w). Since for all a, 6 G R, a +6 < 
4a 2 + 2 (a — 6) 2 , for any it G convS p (w) we have 



M 2 \ V2 

d(u,w)< ^(u(i)-w(i)j • (Aw 2 {i) + 2(u(i) - w(i)) 2 ^ J 

\i=l 

/ M 2 \ 1/2 / M A 1/2 

2- (E - u;(i)) • ^ 2 (z) J + ^ * ( E ( U W " ^) J ^ 4 ^ 
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□ 

Denote 



Q=\\P W :tf 



Let now r be a natural number to be chosen later. Let ko and k\ be the largest 
natural numbers so that 



k ° > diam (Wi, \\-\\ 00 ) = Q 

> 



n 



Then k\ — ko < (2 logr) 1 logn. 
Define functions (pk : Wi — * R by 

(p k (w) =mm{\\u\\ 2 «GconvB 2r ^(w)}+^— ^, if = fc , • • • , fci, 



1 * Jn.log(l + 2v^r0 

=1 + 771 + V r •- — ^ n tj . iik>h. 

2 log r ^ Q • VlogM 

For any ty G Wi the sequence {<fk( w )}kLk is nonnegative nondecreasing and 
bounded by an absolute constant depending only on r. Indeed, if k < k\ then 



2 log r log M 
For k > k\ we have 



^/n-log(l + 2 v / 2r z ) 



< i + — — + y r~ i - y. < 

'- 21ogr f-£ Q-V^gM 



1 \/log(l + 2v^r fc 

1 + — — + c(r) • r" fcl • ^ • V , < C(r). 

2 logr v ; Q 0opZ - V ; 

To prove Lemma 1 we have to show that condition (3.1) holds for {<Pk(w)}%L ko 
with a = (c ■ Q ■ y/logM) -1 . Let x G W± and suppose that the points x\, . . . , xn G 
B r -k(x) satisfy 

d(xj, xi) > r~ k ~ 1 for all j ^ I. 
For k > k\ — 1 condition (3.1) follows from the simple volume estimate 

-i k 1 k 1 

2^2 



N <N(W 1 ,d,r- k - 1 ) < AWJ-IL, 11 ^) <W(Wi,||-||,^=-) < 



1 + 
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Suppose now that ko < k < k± — 1. For j = 1, . . . , N denote by Zj the point 
of convS 2r -fe-2 (xj) for which the minimum of ||z|| is attained and denote by u the 
similar point of conv£? 2r -k (x). By (3.2) and Lemma 3 we have for all j ^ I 



d{xj, xi) < 16 • (d(xj, zj) + d(zj, z\) + d(zi, xi)j < 16 • (l6 • r ~ k ~ 2 + d(zj,zi)^, 
so, d(zj,zi) > \r~ k ~ x if r > 512. Under the same assumption on r we have 
d(zj, x) < 4:^d(zj, Xj) + d(xj, 



Denote 



max \\Zj\\ — \\u\ 

j=l,...,N" 



We have to prove that 

(3.3) r~ k - (c-Q- VlogM) * VlogiV < max (p k+2 (xj) - cp k (x) = 9 + 

\ J j=l,...,N 



logM 



Since 



Zj-U 



Zj + 11 



G convS 2r -fe (x) and \\u\\ < \\zj\\, we have 



in ii 2 1 
2 ™ ' 2 



Zj + u 



Zj + u 



/ ll 1 1 2 n ||2 
< \\Zj\\ — \\u\\ 



SO, 

(3.4) 



-u\\ < 2Ve. 



Thus, N is bounded by the ir~ fc_1 -entropy of the set K = u + 2\/~6B?f fl W in 
the quasimetric d. To estimate this entropy we partition the set K into S disjoint 
subsets having diameter less than j^r~ k ~ l 6~ 1 / 2 in the metric. By part (1) of 
Lemma 2 we may assume that 



(3.5) 



_L r -fc-i . e~ 1/2 y/iogs <c-q- Ve^iogM. 



If S > vAf, we are done, because in this case (3.5) implies (3.3). Suppose that 
S < \/~N. Then there exists an element of the partition containing at least \fN 
points Zj. Let J C {1, . . . , N} be the set of the indices of these points. We have 

(3-6) Iki-^lloo^^"*" 1 ^" 172 
for all j, Z G J, j 7^ I. Since d(zj, z\) > ^r~ k ~ 1 , we have 

1 \ 2 M 2 

-r- fc -M <£(*,■(<) -*i(0) .(z 2 (i) + z?(i))< 

M 2 



(3.7) 



i=l 



4« 2 (0 + ^(0-i {4 i l , jl > 2|u(4)l} (0 + ^(0-i,..| l .,.,,^ l x(0 



-{i\\ Zl \>2\u(i)\} 
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Then (3.4) implies 

M 

(3-8) Y z 2 Ai)-l r |, , , ,(i) < 169 

V ' ^ jK ' U \\zi >2ui) \ w - 



{i\\ Zj \>2\u(i)\} 
i=l 



- 8 



Combining (3.6) and (3.8) we get that (3.7) is bounded by 

//Q-l/2 \ 2 M . x2 

2-160- (— g— r-*-M + 4^ (*,-(<) -*,(<)) '^CO- 

^ ' i=l 

Thus, for all j,l e J, j ^ i we have 

' M 2 \ ^ 2 1 

£(zi(0-*i(0) -« 2 (0 

Then part (2) of Lemma 2 implies 

i , / M \ 1/2 

-r- fc - 1 v ^pT< cVe-Q- f XXCOj <cVe-Q. 

Since for all > 

2v^< v/lolM -0+^=1==, 

VlogM 

we get 

l r - fc - 1 v ^oiiV< ir- fc - 1 v ^pT<C.Q- v ^iM.^+ i _l^^. □ 
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